AITopics | final layer

Collaborating Authors

final layer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Beyond the Surface: Enhancing LLM-as-a-Judge Alignment with Human via Internal Representations

Neural Information Processing SystemsJun-13-2026, 03:22:29 GMT

The growing scale of evaluation tasks has led to the widespread adoption of automated evaluation using LLMs, a paradigm known as "LLM-as-a-judge". However, improving its alignment with human preferences without complex prompts or fine-tuning remains challenging. Previous studies mainly optimize based on shallow outputs, overlooking rich cross-layer representations. In this work, motivated by preliminary findings that middle-to-upper layers encode semantically and task-relevant representations that are often more aligned with human judgments than the final layer, we propose LAGER, a post-hoc, plug-and-play framework for improving the alignment of LLM-as-a-Judge point-wise evaluations with human scores by leveraging internal representations. LAGER produces fine-grained judgment scores by aggregating cross-layer score-token logits and computing the expected score from a softmax-based distribution, while keeping the LLM backbone frozen and ensuring no impact on the inference process.

large language model, machine learning, natural language, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.98)
Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Our experimental evaluations demonstrate that this simple modification significantly improves the quality of localization maps on both the P ASCAL VOC 2012 and MS COCO 2014 datasets, exhibiting a new state-of-the-art performance for weakly supervised semantic segmentation.

artificial intelligence, machine learning, segmentation, (16 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Seoul > Seoul (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

1f88c7c5d7d94ae08bd752aa3d82108b-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 18:54:20 GMT

kernel size, pixel cnn, residual block, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Add feedback

where ℓ = 1,2,,L is the number of hidden layers (ψ(1)(ri) = ψ(ri) and L is the final layer), ReLU is the nonlinear activation function, W (ℓ) E RN N is the weight matrix in layer ℓ,and b

Neural Information Processing SystemsFeb-7-2026, 14:13:37 GMT

These molecular properties were calculated using a hybrid quantum simulation (Gaussian 09) at the B3LYP/6-31G(2df,p) level of theory. In this study, we created a subset of the QM9 dataset with a limited number of atoms, M 14, per molecule, which we refer to as the "QM9under14atoms" dataset in the main text. As the learning/predicting targets, we selected three kinds of energy properties: atomization energy at 0 K, zero point vibrational energy, and enthalpy at 298.15 K. E RN is the bias vector in layer ℓ. The LCAO considers the normalization for the coefficients in Eq. (6) in the main text. Additionally, the normalization term in Eq. (7) in the main text is calculated as follows: Z(qn,ζn)=

artificial intelligence, machine learning, nonlinear activation function, (10 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.07)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

0525fa17a8dbea687359116d01732e12-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 07:07:34 GMT

fourier transform, regularisation, training data, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

Layer Probing Improves Kinase Functional Prediction with Protein Language Models

Kumar, Ajit, Jha, IndraPrakash

arXiv.org Artificial IntelligenceDec-2-2025

Protein language models (PLMs) have transformed sequence-based protein analysis, yet most applications rely only on final-layer embeddings, which may overlook biologically meaningful information encoded in earlier layers. We systematically evaluate all 33 layers of ESM-2 for kinase functional prediction using both unsupervised clustering and supervised classification. We show that mid-to-late transformer layers (layers 20-33) outperform the final layer by 32 percent in unsupervised Adjusted Rand Index and improve homology-aware supervised accuracy to 75.7 percent. Domain-level extraction, calibrated probability estimates, and a reproducible benchmarking pipeline further strengthen reliability. Our results demonstrate that transformer depth contains functionally distinct biological signals and that principled layer selection significantly improves kinase function prediction.

classification, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2512.00376

Country: Asia > India > NCT (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Iterative Inference in a Chess-Playing Neural Network

Sandmann, Elias, Lapuschkin, Sebastian, Samek, Wojciech

arXiv.org Artificial IntelligenceNov-26-2025

Do neural networks build their representations through smooth, gradual refinement, or via more complex computational processes? We investigate this by extending the logit lens to analyze the policy network of Leela Chess Zero, a superhuman chess engine. Although playing strength and puzzle-solving ability improve consistently across layers, capability progression occurs in distinct computational phases with move preferences undergoing continuous reevaluation--move rankings remain poorly correlated with final outputs until late, and correct puzzle solutions found in middle layers are sometimes overridden. This late-layer reversal is accompanied by concept preference analyses showing final layers prioritize safety over aggression, suggesting a mechanism by which heuristic priors can override tactical solutions.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.2138

Country: Europe (0.27)

Genre: Research Report > New Finding (0.92)

Industry: Leisure & Entertainment > Games > Chess (1.00)

Technology: